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Abstract 

The degree of a polynomial representing (or approximating) a function / 
is a lower bound for the quantum query complexity of /. This observation 
has been a source of many lower bounds on quantum algorithms. It has been 
an open problem whether this lower bound is tight. 

We exhibit a function with polynomial degree M and quantum query 
complexity f2(M 1321 - ). This is the first superlinear separation between 
polynomial degree and quantum query complexity. The lower bound is shown 
by a generalized version of the quantum adversary method. 

1 Introduction 

Quantum computing provides speedups for factoring [29], search [15| and many 
related problems. These speedups can be quite surprising. For example, Graver's 
search algorithm fi"5ll solves an arbitrary exhaustive search problem with N possi- 
bilities in time 0(y/N). Classically, it is obvious that time Q(N) would be needed. 

This makes lower bounds particularly important in the quantum world. If we 
can search in time 0(yN), why can we not search in time 0(log c N)l (Among 
other things, that would have meant NP C BQP.) Lower bound of Bennett et al. 
[ 10 1 shows that this is not possible and Graver's algorithm is exactly optimal. 

Currently, we have good lower bounds on the quantum complexity of many 
problems. They mainly follow by two methods 1 : the hybrid/adversary method[ 10, 
4 1 and the polynomials method [9|. The polynomials method is useful for proving 
lower bounds both in classical [23 1 and quantum complexity [9 1. It is known that 
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1. the number of queries Qe(J) needed to compute a Boolean function / by 
an exact quantum algorithm exactly is at least de |^ , where deg(/) is the 
degree of the multilinear polynomial representing /, 

2. the number of queries Q2(f) needed to compute / by a quantum algorithm 

with two-sided error is at least de |^ , where deg(/) is the smallest degree 
of a multilinear polynomial approximating /. 

This reduces proving lower bounds on quantum algorithms to proving lower bounds 
on degree of polynomials. This is a well-studied mathematical problem with meth- 
ods from approximation theory [ 14] available. Quantum lower bounds shown by 
polynomials method include a Q2H) = ^l(^/D(f)) relation for any total Boolean 
function / [9], lower bounds on finding mean and median [22|, collisions and ele- 
ment distinctness E^J. Polynomials method is also a key part of recent £l(\/~N) 
lower bound on set disjointness which resolved a longstanding open problem in 
quantum communication complexity [25|. 

Given the usefulness of polynomials method, it is an important question how 
tight is the polynomials lower bound. (9| [13 ] proved that, for all total Boolean 
functions, Q 2 (f) = 0(deg 6 (/)) and Q E (f) = 0(deg 4 (/)). The second result 
was recently improved to = 0(deg 3 (/)) ETI . Thus, the bound is tight up 

to polynomial factor. 

Even stronger result would be Qe(I) = 0(deg(/)) or Q2(f) = 0(deg(/)). 
Then, determining the quantum complexity would be equivalent to determining 
the degree of a function as a polynomial. It has been an open problem to prove or 
disprove either of these two equalities l9l fT3l . 

In this paper, we show the first provable gap between polynomial degree and 
quantum complexity: deg(/) = 2 d and Q2{f) = Q(2.5 d ). Since deg(/) > 
deg(/) and Qe{I) > Q2(f), this implies a separation both between Qe{I) and 
deg(/) and between Q>z(f) and deg(/). 

To prove the lower bound, we use the quantum adversary method of [4]. The 
quantum adversary method runs a quantum algorithm on different inputs from 
some set. If every input in this set can be changed in many different ways so 
that the value of the function changes, many queries are needed. 

The previously known version of quantum adversary method gives a weaker 
lower bound of Q 2 (f) = 0(2. 1213. .. d ). While this already gives some gap be- 
tween polynomial degree and quantum complexity, we can achieve a larger gap by 
using a new, more general version of the method. 

The new component is that we carry out this argument in a very general way. 
We assign individual weights to every pair of inputs and distribute each weight 
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among the two inputs in an arbitrary way. This allows us to obtain better bounds 
than with the previous versions of the quantum adversary method. 

We apply the new lower bound theorem to three functions for which determin- 
istic query complexity is significantly higher than polynomial degree. The result is 
that, for all of those functions, quantum query complexity is higher than polyno- 
mial degree. The biggest gap is polynomial degree 2 d = M and query complexity 
tt(2.5 d ) = n(M 1 - 321 -). 

Spalek and Szegedy [32] have recently shown that our method is equivalent to 
two other methods, the spectral method of [ 8 1 that was known prior to our work 
and the Kolmogorov complexity method of [ 19 1 that appeared after the conference 
version of our paper was published. Although all three methods are equivalent, 
they have different intuition. It appears to us that our method is the easiest to use 
for results in this paper. 

2 Preliminaries 

2.1 Quantum query algorithms 

Let [N] denote {1,...,N}. 

We consider computing a Boolean function f(x\, . . . , xn) '■ {0, 1}^ — ► {0, 1} 
in the quantum query model (for a survey on query model, see [6| El)- In this 
model, the input bits can be accessed by queries to an oracle X and the complexity 
of / is the number of queries needed to compute /. A quantum computation with 
T queries is just a sequence of unitary transformations 

Uo^O^Ui^O^...^ U T -i ^O^U t . 

The Uj's can be arbitrary unitary transformations that do not depend on the 
input bits x%, . . . , x/v- The O's are query (oracle) transformations which depend 
on x\, . . . , xn. To define O, we represent basis states as \i,z) where i consists of 
[log(A r + 1)] bits and z consists of all other bits. Then, O x maps |0, z) to itself 
and \i,z) to (—l) Xi \i,z) for i G {1, —,N} (i.e., we change phase depending on 
Xi, unless i = in which case we do nothing). 

The computation starts with a state |0). Then, we apply Uq, O x , . . ., O x , Ut 
and measure the final state. The result of the computation is the rightmost bit of 
the state obtained by the measurement. 

The quantum computation computes / exactly if, for every x = (x±, . . . , xn), 
the rightmost bit of UtO x . . . O x Uo\0) equals f(x\, . . . , xn) with certainty. 

The quantum computation computes / with bounded error if, for every x = 
(x\, . . . ,xn), the probability that the rightmost bit of UtO x Ut-i ■ ■ ■ O x Uo\0) 
equals f(x\, . . . ,xn) is at least 1 — e for some fixed e < 1/2. 
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Qe(/) (Q2C/O) denotes the minimum number T of queries in a quantum algo- 
rithm that computes / exactly (with bounded error). D(f) denotes the minimum 
number of queries in a deterministic query algorithm computing /. 

2.2 Polynomial degree and related quantities 

For any Boolean function /, there is a unique multilinear polynomial g such that 
f(x±, . . . ,xn) = g{x\, ■ ■ ■ ,xn) f° r a U x li ■ ■ ■ i x n £ {0, 1}. We say that g repre- 
sents f. Let deg(/) denote the degree of polynomial representing /. 

A polynomial g(x\, . . . , xn) approximates / if 1 — e < g(x\, . . . ,xn) < 1 
whenever /(xi, . . . ,xjv) = 1 andO < g(x\, . . . ,xn) < e whenever /(xi, . . . , xn) = 
0. Let deg(/) denote the minimum degree of a polynomial approximating /. It is 
known that 

Theorem 1 R% 

1. Q E (f) = n(deg(/)); 

2. Q 2 (f) = n(deg(f)); 

This theorem has been a source of many lower bounds on quantum algorithms 

(9l|22l|2l. 

Two other relevant quantities are sensitivity and block sensitivity. The sensi- 
tivity of / on input x = (xi, . . . , xjq) is just the number of i G [N] such that 
changing the value of x\ changes the value of /: 

f{xi, X N ) / f(xi, Xi-l, 1 - Xi,X i+ i, xn). 

We denote it s x (f). The sensitivity of / is the maximum of s x (f) over all x G 
{0,1}^. We denote its(/). 

The block sensitivity is a similar quantity in which we flip sets of variables 
instead of single variables. For x = {x\, . . . ,xn) and S C [N], let x^ be the 
input y in which yi = x, if i £ S and yi = 1 — x» if i G S. The block sensitivity 
of / on an input x (denoted bs x (f )) is the maximum number k of pairwise disjoint 
Si, ■ ■ ., Sj. such that f(x^ Si ^) ^ f{x). The block sensitivity of / is the maximum 
of bs x {f) over all x G {0, 1}^. We denote it bs(f). 

3 Main results 

3.1 Overview 

The basis function. f(x) is equal to 1 iff x = X1X2X3X4 is one of the following 
values: 0011, 0100, 0101, 0111, 1000, 1010, 1011, 1100. This function has the 
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degree of 2, as witnessed by polynomial f(x\, x%, X3, X4) = x\ + x 2 + X3X4 — 
X1X4 — X2X3 — X1X2 and the deterministic complexity D(f) = 3, as shown in 
section l4~3l where we discuss the function in more detail. 

Iterated function. Define a sequence f 1 = f, f 2 , . . . with f d being a function of 
4 d variables by 

f d+1 = f(f d ( x l, • • • ) %4<l), f^ix^d+i, . . . , X 2 .4d), 
/ rf ( x 2-4 d +l' • • • > x 3-4 d ); f d ( x 3-4 d +ly • ■ • ; 2 ; 4 d + 1 ))- (1) 

Then, cfe#(.f d ) = 2 d , L>(/ d ) = 3 d and, on every input x, s x (f d ) = 2 d and 

bs x (f d ) = 3 d - 
We will show 

Theorem 2 Q 2 {f d ) = tt(2.5 d ). 

Thus, the exact degree is deg(f d ) = 2 d but even the quantum complexity 
with 2-sided error Q 2 (f d ) is (1(2. 5 d ) = deg(f d ) x - 2,21 -\ This implies an M-vs.- 
(l(M 2 ) gap both between exact degree and exact quantum complexity and 
between approximate degree and bounded-error quantum complexity. 

The proof is by introducing a combinatorial quantity Q' 2 {f) with the following 
properties: 

Lemma 1 For any Boolean function g, Q 2 (g) = (L(Q' 2 (g))- 

Lemma 2 Let g be an arbitrary Boolean function. If g 1 , g 2 , ... is obtained by 
iterating g as in equation (0, then 

Q'2(g d ) > (Q'2(g)) d - 

Lemma 3 Q' 2 (f) > 2.5. 

Theorem |2] then follows from Lemmas [0 El |3] 

3.2 Previous methods 

Our approach is a generalization of the quantum adversary method [4|. 

Theorem 3 MS Let A C {0, 1}^, B C {0, 1}^, R C Ax B be such that f(A) = 
0, f(B) = 1 an J 

• /or every x £ A, there are at least m inputs y € B such that (x,y) £ R, 

• for every y & B, there are at least m' inputs x E A such that (x,y) £ R, 
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• for every x = (x\ . . . xjy) G A and every i £ [N] there are at most I inputs 
y £ B such that (x, y) £ R and £j 7^ yi, 

• for every y = (yi . . . yjv) € B and every i G [N], there are at most I' inputs 
x £ A such that (2, y) G R and Xi 7^ y^. 

Then, Q 2 (f) = 

There are several ways to apply this theorem to f d defined in the previous 
section. The best lower bound that can be obtained by it seems to be Q2U) = 
Q(2. 1213. . d ) (cf. appendix . This gives some separation between Q2H) and 
deg(f) = 2 d but is weaker than our new method that we introduce next. 

3.3 New method: weight schemes 

We now formally define the combinatorial quantity Q' 2 (/) that we use in Lemmas 
□ |2]and|3] 

Definition 1 Let f : {0, 1}^ -» {0, 1}, A C B Q /"H 1 ) and R O A x 

B. A weight scheme for A, B, R consists of numbers w(x, y) > 0, w'(x, y, i) > 0, 
w'(y, x, i) > Ofor all (x, y) G R and i G [N] satisfying X{ 7^ yi, we have 

w'(x,y,i)w'{y,x,i) >w 2 (x,y). (2) 
Definition 2 The weight of x is wt(x) = 2~2y.(x,y)£R w (. x >y)> if x £ A and 

Definition 3 Let i £ [N]. The load of variable X{ in assignment x is 

v{x,i) = w \ x ^y^) 

if x £ A and 

v(x,i) = w'{x,y,i) 

y:(y,x)&R,x t ^y t 

ifx £ B. 

We are interested in schemes in which the load of each variable is small com- 
pared to the weight of x. 

Let the maximum A-load be va = max^g^ ,ie[N] V wt(x) • ^ et tne max i mum 
B-load be vb = max xe B,ie[N] V wt(x) ■ ^ ne max i mum load of a weight scheme is 

Vmax = y/VAVB- 
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Let Q^if) be the maximum of over all choices of A C {0,1} ,B C 
{0,1} N , R Q Ax B and all weight schemes for A, B, R. We will show in Lemma 
[fl if we have a weight scheme with maximum load v max , the query complexity has 
tobefi(^). 

u max 

3.4 Relation to other methods 

Theorem[3]follows from our new Lemma^if we set w(x, y) = 1 for all (x, y) G R 
and w(x, y, i) = w(y, x, i) = 1 for all i G [N]. Then, the weight of x is just the 
number of pairs (x, y) G R. Therefore, wt(x) > m for all x G A and u>i(y) > m' 
for all y G -B. The load of z in x is just the number of (x, y) G R such that 
Xj / yi. That is, f(x,i) < I and v(y,i) <• Z'. Therefore, < w^j < ^7 and 

u m(B < y^^7- This gives us the lower bound of Theorem |3] 

There are several generalizations of Theorem |3] that have been proposed. Bar- 
num and Saks [7| have a generalization of Theorem |3] that they use to prove a 
Q,{yfN) lower bound for any read-once function on N variables. This generaliza- 
tion can be shown to be a particular case of our Lemma ^ with a weight scheme 
constructed in a certain way. 

Barnum, Saks and Szegedy [8| have a very general and promising approach. 
They reduce quantum query complexity to semidefinite programming and show 
that a t-query algorithm exists if and only if a certain semidefinite program does not 
have a solution. Spalek and Szegedy have recently shown [ 32 1 that our weighted 
scheme method is equivalent to Theorem 4 in [ 8 1 which is a special case of their 
general method. Our method is also equivalent [32| to Kolmogorov complexity 
method by Laplante and Magniez [ 19 1. 

Hoyer, Neerbek and Shi [16| have shown lower bounds for ordered searching 
and sorting using a weighted version of the quantum adversary method, before both 
this paper and [8 1. Their argument can be described as a weight scheme for those 
problems, but it is more natural to think about it in the spectral terminology of [8 1. 

4 Proofs 
4.1 Lemma H] 

In terms of weights schemes, Lemma^becomes 

Lemma [D If a function g has a weight scheme with maximum load v max , then 

Proof: We can assume that va = vb = v max . Otherwise, we just multiply all 

w'(x, y, i) by \Jvb/va and all w'(y, x, i) by \/vaJvb- Notice that this does not 
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affect the requirement ©. In the new scheme va is equal to the old va\/vb/va = 
y/VAVB = v max and v B is equal to the old v B \/v a /vb = ^/vavb = v max - 

Let |^*) be the state of a quantum algorithm after t queries on input x. We 
consider 

w t = w(?>v)\y>M)\- 

(x,y)eR 

For t = 0, Wo = J2(x y)£R w(x, y). Furthermore, if an algorithm computes / in t 
queries with probability at least 1 — e, Wt < 2 a/ e(l — eJWo |4]ED- To prove that 
T = il( ), it suffices to show 

Umax ' 

Lemma 4 — < 2v max Wo- 

Proof: Let \4> x ) be the state of the algorithm immediately before query t. We write 

TV 

with \4>' xi ) being the state of qubits not involved in the query. The state after the 
query is 

TV 

i^) = <oio)i4, )+E<i(-irN)i^)- 
i=i 

Notice that all the terms in {4> x \4>y) and (ip x \ipy) are the same, except for those 
which have / y^. Thus, 

(^1^) - (41^) < 2 E la^lKil 

and 

\w J -w^ 1 \<2 E 

By the inequality < A 2 + B 2 , 

IWi-wG-il^ E E K(^y>*)Kd 2 +™W,*)Kil 2 )- 

(:r,j/)ei? 2:2,7% 

We consider the sum of all first and all second terms separately. The sum of all first 
terms is 

E E w '( x ^^)\ at x,i\ 2 = E l«y 2 E w'(x,y,i)\ 

(x,y)£Ri:xi^yi x£A,ie[N] \y-(x,y)£R,Xiy£yi / 
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= \ a i,i\ 2 ' v ( x > i ) < v a \ a l,i\ 2wt ( x ) 

x<=A,i£{N] x£A,i£[N] 

= v A Y wt ( x ) Y \ a x,i\ 2 = v aY wt ( x ) = v aW . 
xeA ie[N] xeA 

Similarly, the second sum is at most vbWq. Finally, va = vb = v max implies that 

\Wj-Wj- X \ <2v max W . 

4.2 Lemma El 

In terms of weight schemes, we have to prove 

Lemma|2]L£? g be a function with a weight scheme with maximum load v\. Then, 
the function g d obtained by iterating g as in equation (0 has a weight scheme with 
maximum load vf. 

The lemma follows by inductively applying 

Lemma 5 If g has a weight scheme with maximum load v\ and g d ~ l has a weight 
scheme with maximum load v^-i, then g d has a weight scheme with maximum load 

VlV d -l. 

Proof: Similarly to lemma [0 assume that the schemes for g and g d_1 have va = 

VB = V max . 

Let n be the number of variables for the base function g(x\, . . . ,x n ). We 
subdivide the n d variables x\, . . ., x n d of the function g d into n blocks of n d ~ x 
variables. Let x 3 = • . ., Xj n d-i) be the j th block. Furthermore, let 

x be the vector 

(/- 1 (x 1 ),/- 1 (x 2 ),.../- 1 (x")). 

Then, g d {x) = g(x). 

We start by denning A, B and R. Let At, B\, R\ (A2-1, #<z-l> Rd-i) be A, 
B, R in the weight scheme for g (g^ 1 , respectively), x G A (B, respectively) if 

• x G A\ (B\, respectively), and 

• for every j G [n], x 3 G A2-1 if Xj = and x 3 G B^-i if % = 1. 
(x, y) G R if (x, y) G R\ and, for every j G [n], 

• x 3 = y 3 if Xj = y~j. 

• (x 3 , y 3 ) G Rd-i if Xj = 0, yj = 1. 

• (y j , x 3 ) G Rd-i if xj = 1, yj = 0. 
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Let w\ (x, y) denote the weights in the scheme for g and w d -i(x, y) the weights 
in the scheme for g^" 1 . We define the weights for g d as 

Wd(x,y) = wx(x,y) wt d -i{x j ) J| w d - 1 (x j ,y j ) 

where wt d _i is the weight of x-> in the scheme for g l . 

For i G [n ], let i\ = [-^rr] be the index of the block containing i and 
i% = (i — 1) mod n d ~ l + 1 be the index of i within this block. Define 



w' d (x,y,i) = Wd(x,y)i 



lw[(x,y,ii) 



w' d _ 1 (x il ,y il ,i 2 ) 
w' 1 {y,x,i 1 )\ w' d _ 1 (y il ,x il ,i 2 )' 



The requirement $2% is obviously satisfied. It remains to show that the maxi- 
mum load is at most v\v d -\. We start by calculating the total weight wt d (x). First, 
split the sum of all w d (x, y) into sums of w d (x, y) over y with a fixed z = y. 

Claim 1 



E w d (x,y) =w 1 (x,z)Y[wt d _ 1 (x J ). 



ye{0,l}" d :y=z 

Proof: Let y be such that y = z. Then, 



3=1 



w d {x,y) = wi(x,z) Y[ wt d-i{x 3 ) w d _i(x j ,y j ) 

j:Xj=Zj j-.Xjy^Zj 

When xj / zj, y 3 can be equal to any y' G {0, l}" d 1 such that g d ~ l {y') = zj. 
Therefore, the sum of all w d (x, y), y = z is 



u>i ^' z ) n n 



/ 



E 



r-Xj= z 3 r-xj¥=zj \y'e{0,l}" d - 1 ■.g d ~ 1 (y')=z j 

Each of sums in brackets is equal to wt d _i(x 3 ). Therefore, (|3} equals 



(3) 



wi(x,z) Y[ wt d ^i{x j 



Corollary 1 



3=1 



wt d (x) = wti(x) Y[ wtd~i{x j ). 

3=1 



(4) 
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Proof: wtd(x) is the sum of sums from Claim^over all z 6 {0, l} n . Now, the 
corollary follows from Claim[l]and J2ze{o,i} n w i{ x i z ) = wti(x) (which is just 
the definition of wt\(x)). 

Next, we calculate the load 



v(x, %) 



E 

j/6{0,l}' ld 



w' d {x,y,i) 



in a similar way. We start by fixing z = y and all variables in y outside the if 1 
block. Let W be the sum of Wd(x, y) and V be the sum of w 'Ax, y, i), over y that 
have y = z and the given values of variables outside y %1 . 

Claim 2 



V<v d J^p^\w. 
V w[(y,x,ii) 

Proof: Fixing z and the variables outside y 11 fixes all terms in Wd{x,y), except 
Wd-i{x 11 , y* 1 )- Therefore, w d (x,y) = Cwd-i(x l1 , y' L1 ) where C is fixed. This 
means W = Cwt d ~\{x l1 ). Also, 



w' d (x,y,i) = Cwd-i(x n ,y n ) 



w 



d _ 1 (x i \y i \i 2 ) w' x (x,y,h) 



\ w' d _ 1 (y il ,x il ,i 2 ) V w'i{y,x,h) 
Property (O of the scheme for (A d -i, B d -i, Rd-i) implies 



w d -i(x h ,y ll )> 



w' d _ l (x i \y i \i 2 ) 
\ w' d _ 1 (y h ,x il ,i 2 ) 



<^_ 1 (^ 1 ,y il ,i 2 ), 



w d (x,y,i) < Cw d ^{x \y\i 2 )\ _ ■ 

If we sum over all possible y l1 G {0, l} nd 1 , we get 



V <Cv d ^{x l \i 2 )^ 



iw'i(x,y,ii) 



Since t>d-i(x 11 , 22) < i^-iu^-i^ 11 )' we nave 



wi(y,x,h) 



n 



We now consider the part of v(x, i) generated by w' d (x, y, i) with a fixed y. By 

/ 11)^ (x XI % i 1 

the argument above, it is at most i^-iy Ji^^^) times the sum of corresponding 

Wd(x, y). By Claim[T] this sum is wi(x, z) Ilj=i w td-l{ x ^)- By summing over all 
y, we get 



v(x,i) < V v d -iJ W } X,Z _ ,n wi(x,z) TT wtd-iix 1 ) 

zmiv M(wi) 



i= l 2G {0,1}' 1 V w^x^x) 



By property ©, * / z) < w[(x, z, h). Therefore, 



El U) 'i [X j 2> j 1^ ) . ^ — > . . . . , . 

, V it)i (z, x, i] , ' 

ze{o,i} n » ze{o,i} n 

and is at most 

i=l 

By induction, Vd < (vi) d . This proves lemma|2 
4.3 Lemma 13 

We now look at the base function / in more detail. The function / is shown in 
Figure[l] The vertices of the two cubes correspond to (xi,X2,x^,X4) G {0, l} 4 . 
Black circles indicate that f(x±, X2, X3, x&) = 1. Thick lines connect pairs of 
black vertices that are adjacent (i.e., x\xix$x± and 2/12/22/32/4 differing in exactly 
one variable with f(x 1 ,x 2 ,x 3 , 24) = 1 and f(y x , y 2 , 2/3,2/4) = !)• 

From the figure, we can observe several properties. Each black vertex (/ = 
1) has exactly two black neighbors and two white neighbors. Each white vertex 
(/ = 0) also has two white and two black neighbors. Thus, for every x G {0, 1} , 
there are two variables Xi such that changing xi changes fix). We call these two 
sensitive variables and the other two insensitive. From figure^we also see that, for 
any x £ {0, l} 4 , flipping both sensitive variables changes f(x) and flipping both 
insensitive variables also changes f(x). 



12 




A 





V 




x,=0 



x,=l 



Figure 1 : The function / 



Thus, the sensitivity of / is 2 on every input. The block sensitivity is 3 on 
every input, with each of the two sensitive variables being one block and the two 
insensitive variables together forming the third block. 

Finally, D(f) = 3. The algorithm queries x\ and X3. After both of those are 
known, the function depends only on one of x% and X4 and only one more query is 
needed. The lower bound follows from bs(f) = 3. 

We now proceed to proving the lemma. In terms of weight schemes, the lemma 

is 

Lemma|3]T7je function f has a weight scheme with the maximum load of 2.5. 

Proof: Let A = / _1 (0), B = R consists of all (x, y) where x G A and y 

differs from x in exactly 

• one of the sensitive variables or 

• both sensitive variables or 

• both insensitive variables. 

Thus, for every x G A, there are four inputs y G B such that (x, y) G R. Also, 
for every y G B, there are four inputs x G A such that (x, y) G R and again, 
these are x differing from y in one sensitive variable, both sensitive variables or 
both insensitive variables. Notice that, if y differs from x in both variables that 
are insensitive for x, then those variables are sensitive for y and conversely. (By 
flipping one of them in y, we get to an input z which differs from x in the other 
variable insensitive to x. Since the variable is insensitive for x, f(x) = f(z). 
Together with f(x) / f(y), this implies f(y) ^ f(z).) 
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Let w(x,y) = 1 for (x,y) G R with x, y differing in one variable and 
w(x, y) = 2/3 if x, y differ in two variables. Thus, wt(x) = 2 • 1 + 2 • | = ^ for 
all x. w'(x, y, i) is 

• 1 if x and y differ in one variable, 

• | if they differ in both variables sensitive for x, 

• | if they differ in both variables insensitive for x. 

Since = . this is a correct weight scheme. 
We now calculate the load of i. There are two cases. 

1. x is insensitive to flipping X{. Then, the only input y such that (x,y) G R 
and Xi ^ yi is obtained by flipping both insensitive variables. It contributes 

| to v(x, i). 

2. x is sensitive to flipping Xj. Then, there are two inputs y: one obtained by 
flipping just this variable and one obtained by flipping both sensitive vari- 
ables. The load is v(x, i) = 1 + | = |. 

Thus, we get ^Ppl = ^ = 2.5 for all x, i. 
4.4 Theorem 2 

Theorem 2 now follows from Lemmas ^ El El By Lemma El the function / has 
a weight scheme with the maximum load of 2.5. Together with Lemma El this 
implies that f d has a weight scheme with the maximum load of 2.5 d . By Lemma 
[U this means that Q 2 (/) = n(2.5 d ). 

5 Other base functions 

Iterated functions similar to ours have been studied before. Nisan and Wigderson 
ll2*4l used them to show a gap between communication complexity and log rank 
(an algebraic quantity that provides a lower bound on communication complexity). 
Buhrman and de Wolf [ 13 1 proposed to study the functions from [ 24 1 to find out if 
polynomial degree of a function characterizes its quantum complexity. However, 
the base functions that [24, 131 considered are different from ours. 

We now consider the functions from l24l [T3l . Our method shows the gaps 
between deg(/) and Q2(f) for those functions as well but those gaps are consid- 
erably smaller than for our new base function. 
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Function 1 1 23 , 24|. g(x\,X2,X3) is iff all variables are equal. We have 
deg(g) = 2 (as witnessed by g = x\ + X2 + £3 — x\X2 — x\Xz — X2X3), and 
D(g) = 3. 

Lemma 6 g has a weight scheme with max load y/2/3. 

Proof: Let A = g' 1 ^), B = g~ x {l), R = A x B. We set w(x,y) = 2 if 
x, y differ in one variable and w(x, y) = 1 if x and y differ into two variables. 
(Notice that x and y cannot differ in all three variables because that would imply 
g(x) = g{y).) 

The total weight wt(x) is 

• 3-2 + 3-1 = 9 for x *E A (since there are three ways to choose one variable 
and three ways to choose two variables and every way of flipping one or two 
variables changes the value). 

• 2 + 1 = 3 for x *E B. (Each such x has two variables equal and third 
different. It is involved in w(y,x) with y obtained by flipping either the 
different variable or both equal variables.) 

Let x G A, y G B. If x, y differ in one variable xi, we define w'(x, y, i) = 2\/2 
and w'(y,x,i) = y/2. If x, y differ in two variables, w'(x,y,i) = V2/2 and 
w'(y, x, i) = v2 for each of those variables. 

The load of i in x is: 

1. g(x) = 0. 

We have to add up w'(x, y, i) with y differing from x either in xi only or in 
Xi and one of other two variables. We get 2^/2 + 2 • (y/2/2) = 3^/2. 

2. g(x) = 1. 

Then, there is only one input y. It can differ in just xi or x% and one more 
variable. In both cases, w'(x, y, i) = y/2. 

We have v A = = ^ and v B = ^. Therefore, v max = 
This means that Q 2 {g d ) = ^((4j) d ) = ^(2.12.. d ). 

Function 2 (Kushilevitz, quoted in |24|). The function h(x) of 6 variables is 
defined by 

• h(x) = if the number of xi = 1 is 0, 4 or 5, 

• = 1 if the number of X{ = 1 is 1, 2 or 6, 
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• if the number of Xj = 1 is 3, h(x) = in the following cases: x\ = xi = 

X3 = 1, X2 = X3 = X4 = 1, X3 = X4 = X5 = 1, X4 = X5 = Xi = 1, X5 = 
Xi = X 2 = 1, Xi = X 3 = X 6 = 1, Xi = X 4 = X 6 = 1, X 2 = X 4 = X 6 = 1, 

X2 = £5 = xq = 1, X3 = X5 = xq = 1 and 1 otherwise. 
We have deg(h) = 3 and D(h) = 6. 

Lemma 7 h has a weight scheme with max load 4/ \/39. 

Proof: We choose A to consist of inputs x with all Xj = and those inputs x with 
three variables Xj = 1 which have h(x) = 0. B consists of all inputs x with exactly 
one variable equal to 1. R consists of (x, y) such that y can be obtained from x by 
flipping one variable if x = 6 and two variables if x contains three Xj. 

If x = 6 and y G we set -u;(x, y) = w'(x, y, i) = w'(y, x, i) = 1. 

If x has three variables x« = 1 and y is obtained by switching two of those to 
0, we set w(x, y) = 1/8, w'(x, y,i) = and w'(y, x, i) = \. 

To calculate the maximum loads, we consider three cases: 

1. x = 6 . 

wt(x) = 6 and v(x, i) = 1 for all i. 

2. x has three variables x« = 1. 

Then, there are three pairs of variables that we can flip to get to y € B. Thus, 
wt(x) = 3/8. Each x\ = 1 gets flipped in two of those pairs. Therefore, its 
load is v(x, i) = 2 • 1/32 = 1/16. The ratio ^ is 6. 

3. y has 1 variable yi = 1. 

Then, we can either flip this variable or one of 5 pairs of yi = variables to 
get to 1 6 A. The weight is wt(y) = 1 + 5 ■ | = ^. If = 1, then the only 
input x <E A, (x, y) e R with Xj / is x = 6 with x, i) = 1. Thus, 
v(y, i) = 1. If yi = 0, then exactly two of 5 pairs of variables j : yj = 
include the i th variable. Therefore, v(y, i) = 2 ■ \ = 1. 

Thus, t>A = 1/6, t>s = 8/13 and t> ma:E = 2/\/39. 

This gives a 3 d vs. 0((v / 39/2) d ) = 0(3.12. .. fl! ) gap between polynomial de- 
gree and quantum complexity. 

6 Conclusion 

An immediate open problem is to improve our quantum lower bounds or to find 
quantum algorithms for our iterated functions that are better than classical by more 
than a constant factor. Some other related open problems are: 
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1. AND-ORtree. Let 



f(xi, . . . , x 4 ) = (x\ A x 2 ) V (x 3 A a? 4 ). 

We then iterate / and obtain a function of N = 4 n variables that can be 
described by a complete binary tree of depth log 2 N = 2n. The leaves of 
this tree correspond to variables. At each non-leaf node, we take the AND 
of two values at its two children nodes at even levels and OR of two values 
at odd levels. The value of the function is the value that we get at the root. 
Classically, any deterministic algorithm has to query all N = 4 n variables. 
For probabilistic algorithms, A r °' 753 -- = (i+^M)2n q Uer j es are sufficient 
and necessary [ 26 27 30 1 . What is the quantum complexity of this problem? 
No quantum algorithm that uses less than TV 753 -- = (I+^ZM) 2 ™ queries is 
known but the best quantum lower bound is just Q(N - 5 ) = 17(2"). 

A related problem that has been recently resolved concerns AND-OR trees 
of constant depth. There, we have a similar A^-ary tree of depth d. Then, 
&(VN) quantum queries are sufficient ITTI 17711 and necessary |3I|7j]. The 
big-0 constant depends on d and the number of queries in the quantum algo- 
rithm is no longer 0(VN) if the number of levels is non-constant. Curiously, 
it is not known whether the polynomial degree is @(y~N), even for d = 2 

ED. 

2. Certificate complexity barrier. Let Co(/) and C\(f) be 0-certificate and 
1-certificate complexity of / (cf. [13] for definition). Any lower bound 
following from theorems of |4) or weight schemes of the present paper is 
O(y/C (f)Ci(f)) for total functions and O(Vmin(C (/),Ci(/))iV) for 
partial functions 2 l fT9ll33l . 

This has been sufficient to prove tight bounds for many functions. How- 
ever, in some cases quantum complexity is (or seems to be) higher. For 
example, the binary AND-OR tree described above has Co(/) = C\(f) = 
2 n . Thus, improving the known Q(2 n ) lower bound requires going above 

\ n,i/)r, (/). 

To our knowledge, there is only one known lower bound for a total function 
which is better than \J Co(/)Ci(/) (and no lower bounds for partial func- 
tions better than ^mm(C (/), GjJfjN). This is the n(N 2 / 3 ) lower bound 

2 The distinction between partial and total functions is essential here. The methods of |4| and 
the present paper can be used to prove lower bounds for partial functions that are more than 
\l Co(/)C'i(/) but 0(y // min(Co(/), Ci(f))N). Examples are inverting a permutation (4] and 
local search Q. 
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of Shi [2 18 5 1 for element distinctness, a problem which has Co(/) = 2, 
d(/) = iV and VC (/)Ci(/) = 9(ViV). It uses methods quite specific 
to the particular problem and cannot be easily applied to other problems. 
It would be very interesting to develop more methods of proving quantum 
lower bounds higher than 0{\JCo{f)C\{f)) for total functions or higher 
than O( v /min(C (/),Ci(/))A^) for partial functions. 

3. Finding triangles. A very simple problem for which its true quantum com- 
plexity seems to exceed the Q(y / Co(f)Ci(f)) lower bound is as follows. 
We have n 2 variables describing adjacency matrix of a graph. We would like 
to know if the graph contains a triangle. The best quantum algorithm needs 
0(ra 13 ) queries ll3Tll20l an Q(n) lower bound follows by a reduction from 
the lower bound on Graver's search [ 12] or lower bound theorem of [4]. We 
have Co(/) = 0(n 2 ) but C\{f) = 3 (if there is a triangle, its three edges 
form a 1 -certificate), thus Q(n) is the best lower bound that follows from 
theorems in EJ. We believe that the quantum complexity of this problem 
is more than 0(n). Proving that could produce new methods applicable to 
other problems where quantum complexity is more than 0(^Co{f )Ci(f)) 
as well. 

Acknowledgments. Thanks to Scott Aaronson, Yaoyun Shi and Ronald de 
Wolf for their comments about earlier versions of this paper. 
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A Appendix: bounds using previous method 

In this section, we look at what bounds can be obtained for Q2(f d ) for f d defined 
in section ITTI using the previously known lower bound Theorem|5] 

It can be verified that the block sensitivity of / is 3 on every input. By induc- 
tion, we can show that this implies bs x (f d ) = 3 d for every input x G {0, l} 4 . 
This makes it tempting to guess that we can achieve m = ml = 3 d and I = I' = 1 
which would give a lower bound of Cl(3 d ). 

This is not the case. If we would like to use Theorem [3] with I = I' = 1, we 
need two requirements simultaneously: 

1. For every x G A, denote by y%, . . . , y 3 d the elements of B for which (x, y,) G 
R. Then, the sets of variables where (x, yi) and (x, yj) differ must be disjoint 
for all i,j, i ^ j. 

2. For every y G B, denote by x\, . . . , x 3 d the elements of A for which (x{ , y) G 
R. Then, the sets of variables where (xj, y) and (xj,y) differ must be dis- 
joint for all i, j, i ^ j. 

If block sensitivity is 3 d on every input, we can guarantee the first requirement 
(by starting with x G A constructing disjoint Si, ■ ■ ., S 3 d and putting (x, x Si ) into 
R). But, if the set A only contains one x, then ml = 1 and the lower bound is 
^(Vs^) which is even worse than the previous one. 

Therefore, we have to take larger set A. This can break the second requirement. 
Let x, z G A and y G B. Then, we could have (x, y) G R and (z, y) G R. x and 
y would differ in a set of variables Si which is one of 3 d disjoint blocks for x. 
Similarly, z and y would differ in a set Tj which is one of 3 d disjoint blocks for 
z. Now, there is no reason why Si and Tj have to be disjoint! Block sensitivity 
guarantees that Si HSj = for every fixed x but it gives no guarantees about blocks 
for x being disjoint from blocks for z. 

Similarly, if we start with y G B, we can ensure the second requirement but 
not the first. 
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The best that we could achieve with this approach was m = m! = 3 d , I = 1, 
I' = 2 d , as follows. Let A = / -1 (0), B = We inductively construct two 

sets of 3 d disjoint perfect matchings between inputs in A and inputs in B. 

The first set of matchings consists of ordered pairs (x, y), x G A, y G B. For 
d = 1, the first two matchings match each input i G Ato the two inputs y £ B that 
differ in exactly one variable. The first matching is (0011,0001), (0101,1101), 
(1100, 1110), (1010, 0010), (0100, 1100), (1000, 0000), (0111, 1111), (1011, 1001) 
The second matching matches each x G A to the other y G B which differs in ex- 
actly one variable. The third matching matches each x G A to y G B which differs 
from x in both variables that are sensitive for x. This is the first set of 3 matchings. 

The second set of matchings consists of ordered pairs (y,x), y G B,x G A. 
The first two matchings are the same as in the first set. The third matching matches 
each x G A to y G B which differs from x in both variables that are sensitive for 

y- 

For d > 1, we introduce notation and x similarly to section I4~2l 

The first 3 d_1 matchings are constructed as follows. For each x, we find x. Then, 
we find y such that (x, y) belongs to the first matching in the first set. Let i be the 
variable for which X{ ^ y~i. In the k th matching (1 < k < 3 d_1 ), we match each 
x G A to y G B which is defined as follows: 

• If j ^ i, then x J = yK 

• x i is such that (x l , y % ) belongs to the k th matching for d — 1 levels (taking 
matchings from the first set if f(x % ) = and the second set if f(y l ) = 1). 

The second 3 d_1 matchings are constructed similarly, except that we use y for 
which (x, y) belongs to the second matching of the first set. 

To construct the last 3 d_1 matchings, we take y for which (x, y) belongs to the 
third matching. In 2 x 3 d + k th matching, we match x with y defined as follows. 

• if Xi / yi, then y % is the input of length x % for which {x 1 , y l ) belongs to the 
k th matching for d — 1 levels. 

• if Xi = yi, then y % = x l . 

We then define R as the set of (x, y) which belong to one of the 3 d matchings 
we constructed. By induction, we show 

Lemma 8 For the first set of 3 d matchings, m = m! = 3 d , I = 1, I' = 2 d . For the 
second set of 3 d matchings, m = m' = 3 d , V = 1, / = 2 d . 

Proof: First, we prove m = m! = 3 d . In the base case, we can just check that the 
matchings are distinct and, thus, every x G A or y G B is matched to 3 distinct 
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elements of the other set. In the inductive case, consider an element x G A (or 
y G B) and two elements y\ G B and 2/2 £ -B to which it is matched. If (x, yi) and 
(a;, 2/2) belong to two matchings in the same group of 3 rf_1 matchings, then, by the 
inductive assumption y\ 7^ y\ and, hence, y\ 7^ y 2 . If (x, yi) and (x,y 2 ) belong to 
two matchings in different groups, then y~\ 7^ y~ 2 implies y\ 7^ y 2 . 

To prove Z = 1 (or V = 1 for the second set), we first observe that this is true 
in the base case. For the inductive case, we again have two cases. If (x, yi) and 
(x, 2/2) belong to different sets of 3 d_1 matchings, then, for each i G {1, 2, 3, 4}, 
either Xj = yij or i = y 2 j. This means that only one of yi and y 2 can differ 
from x in a variable belonging to x i . If (x, yi) and (x,y 2 ),we apply the inductive 
assumption to (x*,y|) and (x*,y 2 ). 

To prove /' = 2 in the base case, we notice that, if (xi, y) and (x2, y) belong 
to the first and the second matching, then the pairs (xi, y) and (X2, y) cannot differ 
in the same variable. In the inductive case, for every i G {1, 2, 3, 4}, either x~n = 
jji or x~2i = jji- If we have a variable j such that j G {(i — l)4 c( ~ 1 + 1, (i — 
l)4 d— 1 + 2, . . . , i x 4 d_1 } and xij = jji, then (x, y) G R and Xj 7^ yj means 
that (x,y) belongs to either one of the second 3 d_1 matchings or one of the last 
3 rf_1 matchings. By applying the inductive assumption, there are at most 2 d ~ l 
such (x, y) in each of the two sets of 3 rf_1 matchings. This gives a total of at most 
2 x 2 d ~ 1 = 2 d such pairs (x, y). 

The weakness of Theorem |3] that we see here is that all variables get treated 
essentially in the same way. For each y G B, different variables yj might have 
different number of x G A such that (x, y) G R, x% 7^ yj. Theorem|3ljust takes the 
worst case of all of those (the maximum number). Our weight schemes allow to 
allocate weights so that some of load gets moved from variables % which have lots 
of x G A: (x, y) G R, x% 7^ yj to those which have smaller number of such x G A. 
This results in better bounds. 

For the function of section l3~T1 we get 0(2.12.. d ) by old method and Sl(2.5 d ) 
by the new method. For the two functions in section |5j the old method only gives 
bounds that are lower than polynomial degree while the new method shows that 
Q 2 {f) is higher than deg(/) for those functions as well. 
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